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On a weakly Blackwell space we show how to define a Markov 
chain approximating problem, for the target problem. The approxi- 
mating problem is proved to converge to the optimal reduced problem 
under different pseudometrics. A computational example of compres- 
sion of information is discussed. 

Let X n be an homogeneous Markov chain. Suppose the process stops 
once it reaches an absorbing class, called the target, according to a given 
stopping rule: the resulting problem is called target problem (TP). The idea 
is to reduce the available information in order to only use the necessary 
information which is relevant with respect to the target. A new Markov 
chain, associated with a new equivalent but reduced matrix is defined. In 
the (large) finite case, the problem has been solved for TPs: in [1-3], it 
has been proved that any TP on a finite set of states has its "best target" 
equivalent Markov chain. Moreover, this chain is unique and there exists a 
polynomial time algorithm to reach this optimum. 

The question is now to find, in generality, an e-approximation of the 
Markov problem when the state space is measurable. The idea is to merge 
into one group the points that e-behave the same with respect to the ob- 
jective, but also in order to keep an almost equivalent Markov chain, with 
respect to the other "groups". The construction of these groups is done 
through equivalence relations and hence each group corresponds to a class 
of equivalence. In fact, there are many other mathematical fields where ap- 
proximation problems are faced by equivalences. For instance, in integration 
theory, we use simple functions, in functional analysis, we use the density of 
countable generated subspaces and in numerical analysis, we use the finite 
elements method. 

In this paper, the approximation is made by means of discrete equiva- 
lences, which will be defined in the following. The purpose of any approxi- 
mation is to reach the exact solution when e — ► 0. We prove that the sequence 
of approximations tends to the optimal exact equivalence relation defined in 
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[1-3], when we refine the groups. Finer equivalence will imply better approx- 
imation, and accordingly the limit will be defined as a countably generated 
equivalence. 

Under a very general Blackwell type hypothesis on the measurable space, 
we show that it is equivalent to speak on countably generated equivalence re- 
lationships or on measurable real functions on the measurable space of states. 
If we do not work under this framework of Blackwell spaces, we can be faced 
to paradoxes, as it is explained by [6], of enlarging cr-algebras, while decreas- 
ing the information available to a decision- maker. The e-approximation of 
the Markov chain depends always upon the kind of objective. In [9], Jer- 
rum deals with ergodic Markov chains. His objective is to approximate the 
stationary distribution by means of a discrete approximating Markov chain, 
whose limit distribution is close in a certain sense to the original one. How- 
ever, unlike our following work, his purpose is not the explicit and unified 
construction of the approximating process. In this paper, we focus on the 
target problem. We solve extensively the TP, where the objective is con- 
nected with the conditional probability of reaching the target T, namely 
IP(A n £ T\Xq = x), for any n,x. This part extends the work in [1—3], since 
TPs' approximation may help to understand the behavior of those TPs 
where the best equivalent Markov chain is also very large. The setting of an 
approximating problem can be extended to a general form, but we will not 
develop it in this paper. 

1. Main results. Let (X, X) be a measurable space. We equip it with 
an Assumption (AO) that will be explained when required. Let P be any 
transition probability on (X,X). An homogeneous Markov process (X n ) n is 
naturally associated to (X,X,P). In the target problem, we are interested 
in the probabilities of reaching the target class T within n steps, namely in 

P({A n € T}|Ao = x) for any n and x. 

The set T is a priori given and does not change through the computations. 

Definition 1. Let {X,X) be a measurable space and let T e X. Let 
T C X be a sub cr-algebra of X such that T € T. A function P : X x T —> 
[0, 1] is a transition probability on (X, J-) if 

• P(x,-) is a probability measure on T, for any x £ X; 

• P(-,F) is ^-"-measurable, for any F € J 7 . 

Given a transition probability P on (A, T) , we denote by P n the transition 
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probability on (X, T) given inductively by 

pl = p. P n+1 (x,F)= f P(x,dy)P n (y,F) 

Jx 

We denote by TrP(X, X, F) the set of the transition probabilities on (X, T). 
We denote by TP^ — UjrcA'TrP(X, X, J-) the set of all transition probabil- 
ities on X, that we equip with a suitable pseudometric d 

d{P l , P 2 ) = sup P n I p i(x, T) - P 2 >, T) | . 

x n 

It is such that 

d(P 1 ,P 2 ) = Pf (re, T) = P 2 "(x, T), for any n and x. 

This pseudo-metric is obviously compatible with the target T and appears 
as a loss function of using P 2 instead of Pi, whatever is the initial point x 
{(3 & (0, 1) is a discount rate). The point will be, with no surprise, to let this 
distance go to 0. 

A target problem is defined through a transition probability P 6 (T¥x,d). 

Definition 2. A target problem is a quadruple (X, F, T, P), where P € 
TrP(X, X , JF) and TG J. A simple target problem is a target problem where 
J 7 is generated by an at most countable partition of X. 

The main purpose of this paper is to approximate any target problem 
with a sequence of simple target problems in the spirit of the construction 
the Lebesgue integral, where the integral of a function / is approximated by 
the integral of simple functions f n = J2i Cild- The Lebesgue approximation 
requires at each step n£N two choices: the choice of the subdivision (Cj)j 
and the choice of the function values (cj)j on each subdivision. 

Definition 3. We call strategy Str a sequence of maps (Str n ) n from the 
set of the target problems to the set of the simple target problems. 

In the "Lebesgue example" given above, the strategy is related to the 
"objective" of the problem (the integral) and the pseudometric d(f, f n ) = 
J \ f — fn\dx is required to go to as n goes to infinity. Here also, a strategy 
is meaningful if d(P, Str n (P)) tends to as n goes to infinity. Moreover, 
for what concerns applications, given a target problem (X, X, T, P) a good 
strategy should not need the computation of P n , n > 1. The first main 
result of this paper is the existence of a class of good strategies, called target 
algorithms. 
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Theorem 1. For any target problem (X, T, P) and any target algo- 
rithm Str, 

lim d(P,P n )=0, 

n— >oo 

where (X,T n ,T,P n ) = Str n (X, T, T, P). 

Two questions immediately arise: does the sequence (Str„(P)) n have a 
limit (and in which sense)? Moreover, since d is defined as a pseudometric, 
does this limit depend on the choice of Str? 

The extension of the concept of compatible projection given in [1-3] to our 
framework will enable us to understand better the answer to these questions. 
A measurable set A ^ of a measurable space (X, X) is an ^-atom if it has 
no non-empty measurable proper subset. No two distinct atoms intersect. If 
the cr-field is countably generated, say by the sequence {A n } then the atoms 
of X are of the form O n C n where each C n is either A n or X \ A n . 

Definition 4. An equivalence relationship tt on a measurable set (X, X) 
is measurable (discrete) if there exists a (discrete) random variable / : 
(X, X) — > (M, £>r) (<Br denotes the Borel cr-algebra), such that 

xiry f( x ) = f(y), 

and we denote it by tt = ttj. Let (X, T, T, P) be a target problem. A com- 
patible projection is a measurable equivalency iTf such that 

(1) P(x,F) = P(y,F), Vxir f y,VF€*(f). 

A compatible projection tt is said to be optimal if tt 2 for any other 
compatible projection tt' . 

Remark 1. This definition is well posed if 

Assumption (AO) ensures that the definition of measurable equivalency is 
indeed well posed. This assumption will be stated and discussed in Section 
4. 

Theorem 2. If tt = tt j is a compatible projection for the target problem 
(X,J-,T,P), then there exists a target problem (X,a(f),T, P n ). such that 
P«(x,F) = P{x,F) for any F € a{f). 
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It is not said "a priori" that an optimal compatible projection must exist. 
If it is the case, then this equivalence is obviously unique. 

Theorem 3. For any target problem (X, J 7 , T, P), there exists a (unique) 
optimal compatible projection ir. 

To conclude the main results, let us first come back to the Lebesgue ex- 
ample. The simple function f n = J2i c i^Ci are chosen so that a(C n ) increases 
to <t(/) and f n (x) —> f(x). The following theorem guarantees these two facts 
by showing the "convergence" of any strategy to the optimal problem. 

Theorem 4. Let Str„(X, T, T, P) = (X, T n ,T, P n ), with Str target algo- 
rithm and let ir be the optimal compatible projection associated to the target 
problem (X, J 7 , T, P) . Then 

• F n Q Fn+i for any n, and y n T n = 

• lim n P n (x, F) = P n (x, F), for any F £ U m f m . 

Remark 2 (The topology Top). In Theorems 1-4, we have proved the 
convergence of (P n ) n to P % with respect to the pseudometric d. The pseu- 
dometric topology Top is the topology induced by the open balls B r (P) = 
{Q £ TFx ■ d(P, Q) < r}, which form a basis for the topology. Accordingly, 
the previous theorems may be reread in terms of convergence of P n to P on 
the topological space (TPx 5 Top). 

1.1. Connection with weak convergence. Given a strategy (X, J- n ,T, P n ) n . 
if we want to show a sort of weak convergence of P n (x, •) to P(x, •), for any 
x, we face the two following problems 

• each P n (x, ■) is defined on a different domain (namely, on J- n ); 

• we did not have required a topology on X. 

First, we want to introduce a new definition of probability convergence which 
takes into account the first restriction. The idea is given in the following 
example. 

Example 1. Let T n = a({(i2~ n ,(i + l)2' n ],i = 0, ...,2 n - 1}) be 
the cr-algebra on (0, 1] generated by the dyadic subdivision. Suppose we 
know that u n : T n — > [0, 1] is the unique probability on T n s.t. for any i, 
u n ((i2~ n , (i + l)2~ n ]) = 2~ n . Even if v n is not defined on the Borel sets of 
(0, 1], it is clear that in "some" sense, it must happen that v n — ► v*, where 
is the Lebesgue measure on the Borel sets of (0, 1]. Note that the cumulative 
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function of v n is not denned, and therefore a standard weak convergence 
cannot be verified. 
In fact, we know that 



( 2 ) ^((-°°>^])=^(( >^])=^ 



i.e., in this case, as n — ► oo, we can determine the cumulative function in a 
dense subset. This fact allows to hope that v n — > in a particular sense. 

Definition 5. Let (X,X,(X n ) n ) be a filtered space, and set Xoo = 
V ' n X n . Let i> n : X n — ► [0, 1] , n > 1 and : — > [0,1] be probability 
measures. We say that v n converges totally to on the topological space 
(X, t) as n tends to infinity, if v n ——> (converges in weak sense on (X, r)), 

T 

for any 9 n : X^ — ► [0,1], such that v n \ Xn = v n- We write v n z^oq. 

Going back to the example, it is simple to check that v n u*. where 

t(0,1] 

u n , are given in Example 1 and r(0, 1] is the standard topology on (0, 1]. 
In fact, let (P n ) n be any extension of {y n )n to the Borel sets of (0,1]. For 
any t € (0, 1), we have by (2) that 

t < F D (t) <t-\ , 

where Fp n is the cumulative function of i> n , which implies the weak conver- 
gence of v n to z/* and, therefore, v n z/*. 

t(0,1] 

For what concerns the topology on X, we will define the topological space 
(X, qp) induced by the pseudometric dp associated to the target problem 
(X, J 7 , T, P), and the pseudometric d. In this way qp is defined only with the 
data of the problem. One may ask: is this topology too poor? The answer is 
no, since it is defined by the interesting pseudometric dp. In fact, dp(x, y) < 
e means that x and y play "almost the same role" with respect to T. A 
direct algorithm which takes dp into account needs the computation of P n 
at each step. In any case, even if dp may not be computable, it defines a 
nontrivial interesting topology qp on X. As expected, we have the following 
theorem. 

Theorem 5. Let Str n (X, J 7 , T, P) = (X,F n ,T,P n ), with Str target al- 
gorithm. Then 

P n ^P. 
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2. The target algorithm. In this section, we introduce the core of the 
approximating target problem, namely a set of strategies Str which solves 
the target problem. 

Given a measurable space (X,X) and a target problem (X, T, T, P), the 
target algorithm is built in the spirit of the exact one given in [1, 2], which 
starts from the largest classes T and X \ T and then reaches the optimal 
classes according to a backward construction. 

The target algorithm defines a strategy Str = (Str n ) n , where 

St % (X, F,T,P) = {X,f n ,T,P n ), 

and it consists of three steps: 

1. the choice of a sequence (~ en ) n of equivalences on the simplex on the 
unit ball of i\ with e n — > 0; 

2. the definition of a filtration {J- n ) n based on (~ en ) n where each T n is 
generated by a countable partition of X; 

3. the choice of a suitable measure /i and the definition of (P n )n- 

2.1. Preliminary results on measurability and equivalency, and the choice 
of (~e n )n- Associated to each countably generated sub cr-algebra A C X, 
we define the equivalence relationship ir A induced by the atoms of ^4: 

xir A y <==^ [x] A := (l{A G A: x G A} = C\{A € A: y G A} =: [y] A . 

Thus, if (A n ) n is a sequence of countably generated u-algebras, then 

( 3 ) A* = n n 7T4„ . 

Now, the atoms of the cr-algebra T of each simple target problem (X, T, Q) 
are at most countable, by definition. Then Q may be represented as a transi- 
tion matrix on the state set N. Each row of Q is a distribution probability on 
N (i.e. a sequence (p n ) n in the simplex S of The first step of the target 
algorithm is to equip S with the £i-norm and then to define an e-equivalence 
on S. 

We will alternatively use both the discrete equivalencies and the countable 
measurable partitions, as a consequence of the following result, whose proof 
is left to appendix. 

Lemma 6. Given a measurable space (X,X), there exists a natural bi- 
section between the set of discrete equivalencies on X and the set of the 
countable measurable partitions of it. 
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Let B £l (0, 1) be the unit ball in l x and S = {x > 0} n B h (0, 1) be the 
simplex on l\. Let Q n = [0, 1], for any n, and r be the standard topology on 
[0, 1]. Denote by #[o,i] the Borel cr-algebra on [0, 1] generated by r. We look 
at S as a subset of Tl^ =1 £l n so that the Borel u-algebra B$ induced on S is 

®- 1 % 1] ns. 



Definition 6. 
on (S, Bs) and \\p 



~ e is an e-equivalence on S if it is a discrete equivalence 
g||i < e whenever p ~ e q. 



Remark 3. The choice of £i-norm on S is linked to the total variation 
distance between probability measures. The total variation distance between 
two probability measures P and Q is defined by cItv(P, Q) = sup^gQ \P(A) — 
Q(A)\. Now the total variation of a measure [i is ||/i||(f2) = sup^i IM^i)!' 
where the supremum is taken over all the possible partitions of f2. As (P — 
Q)(tt) = 0, we have that d T v(P,Q) = \\\P ~ Q\\, see [4]. To each p G S 
corresponds the probability measure P on N with P(i) = pi (and viceversa). 
Therefore, since \\p — q\\\ = \\P — Q\\ = 2cItv(P, Q), we have 



P 



q^d TV (P,Q) < e/2 



Pn 

e2~ r > 



c2 



Example 2. Define the e-cut as follows, p ~ e q <J=^> 
%7rJ , Vn, where |_a;J denotes the entire part of x. Then ~ e is an e-equivalence 



on 5. Indeed, 



S/ ~ e is at most countable (since we divide each [0, 1] into classes of 
length e2~ n ). 
For any p E S 



M ={S€S:7T~.( 9 ) =*„.(?)} =11 



rl^J' ( 



2"Pn 



+ 1 e 



2" 



is measurable with respect to Bs- 
Vp ~ e q, 

\\p-q\\ 1 <J2e2~ n = e. 



2.2. The choice of (F n )n- Given a sequence (~ e „)neN of e-equivalences 
on S, we define the choice of {J- n ) n inductively. This algorithm is a good 
candidate to be a strategy for the approximating problem we are facing and 
it is based on this idea: consider the equivalence classes given by T n -\ and 
divide them again according to the following rule. Starting from any two 
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points in the same class, we check whether the probabilities to attain any 
other J-" n _i-classes are e-the same. Mathematically speaking: 
Step : Tq = a(T) = {0, T,X\T, X} 

Step n : T n is based on the equivalence T n -\ and on ~ en , inductively. 
T n -\ is generated by a countable partition of X, say (A^ n We define, 
for any couple (x, y) 6 X 2 , 

(4) (xvr n y) (xvr„_ iy ) A ((P(x,4^) 4 ) (P(y, A^)*)) ■ 

The following Lemma 7 shows that 7r n is a discrete equivalency on (X, X), 
and therefore it defines T n = a(X/ir n ) as generated by a countable partitions 
of X. 

Remark 4. In applications, {J- n )n must be finitely generated. This is 
not a big restriction. In fact one can prove inductively that this is always the 
case if the projection ~ £n divides each component of S into a finite number 
of subsets, as in Example 2. The choice of the "optimal" sequence (~ e „)n 
is not the scope of this work. We only note that the definition of ~ £ can 
be relaxed and the choice of the sequence (~ en ) n may be done interactively, 
obtaining a fewer number of classes (A™ )j at each step. 

Lemma 7. (J- n )n is a filtration on (X,J-). Moreover, for any n € N, ir n 
is a discrete equivalency on (X, X) . 

Proof. The monotonicity of (J- n ) n is a simple consequence of (4). 

The statement is true for n = 0, since T 6 X. For the induction step, let 
{A^ 1 1 \Aq^ , . . .} G X be the measurable countable partition of X given 
by X/7r n _i. The map h : (X,X) -» (S,B(S)) given by x i-> {P(x ) Af~ 1) )) i 
is therefore measurable. As ~ £n is a discrete equivalency on (S, £>s), the map 
7r^ en o /i : (X, A?) — > (5/~ e „ , 2 s '/~ e ™ ) is also measurable, where 7r^ en is the 
natural projection associated with ~ en . Thus, two points x,y £ X are such 
that 

if and only if their image by Tt~ en o h is the same point of <S/~ en . The 
new partition of X built by 7r n is thus obtained as an intersection of the 
sets A^ n , i > 1 — which formed the 7r n _i-partition — with the counter- 
images of Sy~ £n by 7r^ £n oh. Intersections between two measurable countable 
partitions of X being a measurable countable partition of X, we are done. 

□ 
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2.3. The choice of [i and the definition of (P n ) n . Before defining (P n ) n , 
we need the following result, which will be proved in Section 5. 

Theorem 8. Let (7r n ) n be defined as in the previous section and let 
^oo = n„7r n . Then tt^ is a compatible projection. 

As a consequence of Theorem 2 and of Theorem 8, a target problem 
(X, VnJ^njT, Poo) is well defined. We intend to define P n as the /i-weighted 
mean average of Poo given the information carried by T n . 

More precisely, let \x be a probability measure on (X, \/ n ^F n ) such that 
fj.(F) > 0, for any F G J- n , (the existence of such a measure is shown 

in Example 3). 

For any F G !F n , let Y F be the V^^-random variable such that Y f (oj) = 
Poo(uj,F). Define 

(5) P n (x,F) = E u [Y F \F n ](x), VxGX,VFG^ n . 

P n is uniquely defined on (X x J- n ), the only ^i-null set of T n being the 
empty set. Then we can ensure that P n (x, •) is a probability measure, for 
any x G X. 

We give in the following an example of the measure \x that has been used 
in Equation (5) which justifies its existence. 

Example 3. Let (Y n ) n >o be a sequence of independent and identically 
distributed geometric random variables, with PyC?) = l/2- ? ,j G N. Let 
A n = o-(Yq, ■ ■ ■ ,Y n ) and set A = \ZA n - There exists a probability measure 
P on A such that 

P(nto{^ = Vi}) = w Yh (yi) ® • • • ® Py in (y n ) = v ! , 

and thus, ¥(A) > 0, \/A G An, A ^ 0. Moreover, it follows that for any n, 

(6) A 1 eA n ,A 2 ea(Y n+1 ),A 1 ^0,A 2 ^0, => F(A 1 nA 2 )>0. 

We check by induction that we can embed T n into A n , for any n > 0. The 
searched measure /i will be the trace of P on the embedded cr-field V„J„. 

For n = 0, define T i— > {Yq = 1}, X\T i— » {Yo > 2}. The embedding forms 
a nontrivial partition, and therefore the restriction of P to the embedding 
of JF defines a probability measure on Tq with Ho{F) > if F / 0. 



G. ALETTI, D. SAADA/TARGET PROBLEMS IN BLACKWELL SPACES 11 

(n) 

For the induction step, suppose it is true for n. Given F£ £ J- n , we then 
have F^ i— > j4^"\ where )j is a nontrivial partition in A n and therefore 
the restriction of P to the embedding of T n defines a probability measure 
fj, n on T n with n n (F) > if F ^ 0. 

Given F 4 (n) , let ^ n+1) := {if 1 +1) : Fj n+1) C F- (n) }. The monotonicity 
of 7r n ensures that each F^ + will belong to one and only one F[\ n+l \ 
Moreover, by definition of if 1 , we have that 

(7) F 4 (n) = U{Ff +1) : F$ n +1) G tff n+1) }. 

Since X/-7r n+1 is at most countable, we may order H\ n+1 ^ for any i. We have 
accordingly defined an injective map X/Tr n+ i — > N 2 , where 

F ( n +V ^ _p ? (™+ 1 ) is t he Jfe-th element in 

According to the cardinality of H\ n+l \ define the n + 1-embedding 

3 ~^" A i n \ { Y n+1 >k} ifk = #{Ht +1) } 

By definition of A^ 1 ^ and (7), it follows that we have mapped J- n +i into 

a partition in An+i- Moreover, P(^ n+1) ) > as a consequence of (6). The 
restriction of P to the embedding of J- n +\ defines a probability measure on 
jF n+ i with fi n+ i(F) > if F ^ 0. Note that ii n +i is by construction an 
extension of fi n to ^" n +i- 

Finally, the extension Theorem ensures the existence of the required /x, 
which is just mapped to the trace of P on the embedded J-oo- 

3. Numerical Discrete Example. 

Example 4 (Coupon Collector). Let n objects {ei,...,e n } be picked 
repeatedly with probability pi that object is picked on a given try, with 
J2iPi = 1- Find the earliest time at which all n objects have been picked at 
least once. 



It is not difficult to show that the general Coupon Collector's Problem 
may be embedded into a Markow network of N = 2 n — 1-nodes (see, [1]). 
Thus, let P be a N x iV-transition matrix on the state set X = {!,..., N}. 
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For any X/tt = {xi, . . . ,x n } (n < N), we define the N x ra-matrix 

Qij ~ 



1, if i G xj; 
0, otherwise. 



Then P. = PQ is a nonnegative N x n-matrix. It is a transition probability 
matrix from X to X/tt (we called it P(x,Aj )). Each row i represents the 
restriction of P(i, •) to {-7r _1 (xj),j = l,...,n}. As noted in the proof of 
Theorem 1, we should choose a probability measure [i on X and then define 
a new matrix P on a(X/n) with (5). A "neutral" choice for \x is = 1/iV. 
Accordingly, by (5), for any i,j G {1, . . . , n}, 



#{f : ^(0exi} /y vr(0 G x;}' 

«:7r(Z)GXi AT Z:7r(0ex i 7 ' L w 1J 

A simple computation gives 

P = {Q T QY 1 Q T PQ 

where (Q T Q)~ 1 = diag( #{f: n \ l)£xi} , #{i: ^ )exa} 1 • • • > 

We have tested two target algorithms on a coupon collector problem with 
n = 18 objects. In this case P is given by a 2 18 x 2 18 -sparse matrix. The 
computation of P n is not practicable. The number of components of each 
class of X/ir m is plotted for m = 0,1,2,3 in the following Figure 1 and 
Figure 2. The sequence (X/ir m ) m varies according to the target algorithm. 
We remark that both strategies converge to the same exact solution. 

4. Blackwell. The problem of approximation is mathematically differ- 
ent if we start from a Markov process with a countable set of states or with 
an uncountable one. Let us consider, for the moment, the countable case: X 
is the at most countable set of the states and X = 2 X is the power set. Each 
function on X is measurable. If we take any equivalence relation on X, it is 
both measurable and identified by the cr-algebra it induces (see Theorem 11 
below). This is not in general the case when we deal with a measurable space 
(X, X), with X uncountable. In this section, we want to connect the process 
of approximation with the upgrading information. More precisely, a mea- 
surable equivalence ir = ttj defines both the partition X/tt and the sigma 
algebra cr(/). One wishes these two objects to be related, in the sense that 
ordering should be preserved. Example 7 below shows a paradox concerning 
TTj and <j(f) when X is uncountable. In fact, 
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X/n = {T,X\ T} 




X M = {T = A« A«,A«} 



X/V 3 = {T = A< 3) ,...,Ag ) } 



1 2 3 4 5 6 7 8 9 10 11 12 13 



5 10 15 20 25 30 



Fig 1. Number of states for each class of X/ir m (log scale), with e\ = 0.5, ei = 0.1, 
£3 = 0.05. The choice o/(~ £ „)„ is made as in Example 2. 



X/tto = {T,X\T} 



X/n 2 = {T = A?\...,A<g} 



1 2 3 4 5 6 7 



xm = {t = a( 1 UP',4 1 '} 



x/V 3 = {t = a< 3) ,...,a< 1 3 1 ) } 



5 10 15 20 25 30 



Fig 2. Number of states for each class of X/nm (log scale) with ei = 0.5, ei = 0.1, 
£3 = 0.05, with a different choice of the target algorithm. 



G. ALETTI, D. SAADA/TARGET PROBLEMS IN BLACKWELL SPACES 14 



Lemma 9. Let A\ C A2 be countably generated sub a-algebras of a mea- 
surable space (X,X). Then [x]^ 2 [%]a2- 

In particular, let f,g be random variables. If o~(f) 2 o~(g), then nf C 7r g . 



The problem is that even if a partition is more informative than another 
one, it is not true that it generates a finer cr-algebra, i.e., the following 
implication is not always true for any couple of random variables / and g 



Then Lemma 9 is not invertible, if we do not require the further Assump- 
tion (AO) on the measurable space (X, X). This last fact connects the space 
(X, X) with the theory of Blackwell spaces (see Lemma 10). We will assume 
the sole Assumption (AO). 

Example 5 ("7T/ = ir g =fr cr(f) = o~(g)). We give here a counterexample 
to Assumption (AO), where two random variables f,g generate two differ- 
ent sigma algebras o~(f) 7^ cr(g) with the same set of atoms. Obviously, 
Assumption (AO) does not hold. 

Let (X, Bx) be a Polish space and suppose Bx £ X. Let A £ X \ Bx and 
consider the sequence {A n , n € N} that determines Bx, he. Bx = cr(A n , n 6 
N). Let A = a(A,A n ,n € N). Bx 5 A. As a consequence of Lemma A. 3, 
there exist two random variables f,g such that Bx = &{f) and A = o~{g). 
The atoms of Bx are the points of X, and then the atoms of A are also the 
points of X, since Bx Q A. 

We recall here the definition of Blackwell spaces. A measurable space 
(X, X) is said Blackwell if X is a countably generated <7-algebra of X and 
A = X whenever A is another countably generated cr-algebra of X such that 
A C X, and A has the same atoms as X. A metric space X is Blackwell 
if, when endowed with its Borel cr-algebra, it is Blackwell. The measurable 
space (X, X) is said to be a strongly Blackwell space if X is a countably 
generated cr-algebra of X and 

(Al) A\ = A2 if and only if the sets of their atoms coincide, where Ai and 
A2 are countably generated cr-algebras with A% C X i = 1, 2. 

For what concerns Blackwell spaces, the literature is quite extensive. 
D. Blackwell proved that every analytic subset of a Polish space is, with re- 
spect to its relative Borel cr-field, a strongly Blackwell space (see [5]). There- 
fore, if (X,Bx) is (an analytic subset of) a Polish space and Bx £ X, then 



Proof. See Appendix A. 



□ 



(AO) 
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(X, X) cannot be a weakly Blackwell space. To see this, take A\, A 2 . . . a base 
of B x and A G X \ B X - Then B x = a(A 1 ,A 2 . . .) and A = a(A, A U A 2 ...) 
have the same set of atoms (the points of X) but A ^ Bx (or, equivalently, 
the identity function : (X,Bx) — ► {X,A) is not measurable). Moreover, 
as any (at most) countable set equipped with any cr-algebra may be seen as 
an analytic subset of a Polish space, then it is a strongly Blackwell space. 

A. Maitra exhibited coanalytic sets that are not Blackwell spaces (see 
[10]). M. Orkin constructed a nonanalytic (in fact nonmeasurable) set in a 
Polish space that is a Blackwell space (see [11]). Jasihski showed (see [7]) 
that continuum hypothesis (CH) implies that there exist uncountable Sier- 
pihski and Luzin subsets of M. which are Blackwell spaces (implying in a 
strong way that Blackwell spaces do not have to be Lebesgue measurable or 
have the Baire property). Jasihski also showed that CH implies that there 
exist uncountable Sierpihski and Luzin subsets of M which are not Blackwell 
spaces (implying in a strong way that Lebesgue measurable sets and sets 
with the Baire property do not have to be Blackwell spaces). This latter 
result is strengthened by R.M. Shortt in [13] by showing that CH implies 
the existence of uncountable Sierpihski and Luzin subsets of M which are 
highly non-Blackwell in the sense that all Blackwell subspaces of the two 
sets are countable. 

Note that Assumption (AO) and Assumption (Al) coincide, as the follow- 
ing Lemma states. 

Lemma 10. Let (X, X) be a measurable space. Then (AO) holds if and 
only if (Al) holds. 

Proof. Lemma A. 3 in appendix states that A C X is countably gener- 
ated if and only if there exists a random variable / such that A = o~(f). In 
addition, as a consequence of Lemma 9, we have only to prove that (Al) 
implies (AO). By contradiction, assume (Al), ttj C 7r g , but a{g) ^ cr(f). We 
have cr(f,g) ^ &(f), and then 7To-(/, 5 ) ^ vrj by (Al) and Lemma A. 3. On the 
other hand, as a consequence of Eq. (3), we have that ^ a (f,g) = 7r <j(/)vo-(g) = 

TTf C\TT g = TTf. □ 

We call weakly Blackwell space a measurable space (X, X) such that As- 
sumption (AO) holds. If (X, X) is a weakly Blackwell space, then (X, T) is 
a weakly Blackwell space, for any T C X . Moreover, every strong Blackwell 
space is both a Blackwell space and a weakly Blackwell space whilst the other 
inclusions are not generally true. In [8, 12], examples are provided of Black- 
well spaces which may be shown not to be weakly Blackwell. The following 
example shows that a weakly Blackwell space need not be Blackwell. 
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Example 6 (weakly Blackwell =fr Blackwell). Let X be an uncountable 
set and X be the countable-cocountable cr-algebra on X. X is easily shown 
to be not countably generated, and therefore (X, X) is not a Blackwell space. 
Take any countably generated u-field A C X, i.e. A = a({Ai, i 6 N}). 

• Since each set (or its complementary) of X is countable, then, with- 
out loss of generality, we can assume the cardinality of X \ Ai to be 
countable. 

• Each atom B of a(Ai,i £ N) is of the form 

(8) B = (11=1,2,. ..Cj, where Q = Ai or Ci = X \ Ai, for any i. 

Note that the cardinality of the set A := Ui(X \ Ai) is countable, as it is 
a countable union of countable sets. As a consequence of (8), we face two 
types of atoms: 

1. for any i, Cj = Ai. This is the atom made by the intersections of all 
the uncountable generators. This is an uncountable atom, as it is equal 
to X\A. 

2. exists i such that Cj = X \ Ai. This implies that this atom is a subset 
of the countable set A. Therefore, all the atoms (except X \ A) are 
disjoint subsets of the countable set A and hence they are countable. 

It follows that the number of atoms of A is at most countable. Thus, (X, A) 
is a strongly Blackwell space, i.e. (X, X) is a weakly Blackwell space. 

Example 7 (Information and cr-algebra (see [6])). Suppose X = [0,1], 
X = cr(y, A) where y is the countable-cocountable cr-algebra on X and 
A = [0,1/2). Consider a decisionmaker who chooses action 1 if x < 5 and 
action 2 if x > ^. Suppose now that the information is modeled either as 
the partition of all elements of X, t = {x,x £ X} and in this case the 
decisionmaker is perfectly informed, or as the partition r' = {A, X \ A}. 
If we deal with cr-algebras as a model of information then <t(t) = y and 
c(r') = cr(A). The partition r is more informative than r', whereas <r(r) 
is not finer than c(r'). In fact A tfL y and therefore if the decisionmaker 
uses ct(t) as its structure of information, believing it more detailed than 
o"(t'), he will never know whether or not the event A has occurred and can 
be led to take the wrong decision. In this case, <r-algebras do not preserve 
information because they are not closed under arbitrary unions. However, 
if we deal with Blackwell spaces, any countable u-algebra is identified by 
its atoms and therefore will possess an informational content (see [14], for 
example) . 
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The following theorem, whose proof is in Appendix B, links the measura- 
bility of any relation with the cardinality of the space and Assumption (AO). 
It shows the main difference between the uncountable case and the countable 
one. 

Theorem 11. Assume (CH). Let (X,X) be a measurable space. The 
following properties are equivalent: 

1. Any equivalence relation ir on X is measurable and Assumption (AO) 
holds; 

2. {X,2 X ) is a weakly Blackwell space; 

3. X is countable and X = 2 . 

5. Proofs. The following theorem mathematically motivates our ap- 
proximation problem: any limit of a monotone sequence of discrete equiva- 
lence relationships is a measurable equivalence. 

Theorem 12. For all n G N, let ir n be a discrete equivalency. Then 
^oo = n n 7r n is a measurable equivalency. Conversely, for any measurable 
equivalency tt, there exists a sequence (ir n ) n of discrete equivalencies such 
that TToo = n n 7r n . 

Proof. See Appendix A. □ 

Proof of Theorem 2. Let tt = ttj be a compatible projection. We de- 
fine 

P 7T (x,F):=P(x,F), VxG X,VF€a(f). 

What remains to prove is that P n G TiP(X, X ,a(f)). More precisely, we 
have to show that P n (-,F) is <r(/)-measurable, VF € <r(/). By contradic- 
tion, there exists F G c(f) such that the random variable Y F {oS) = P^oj, F) 
is not <r(/)-measurable. Then a(Y F ) <r(/), and hence tt y f ^ ftf by As- 
sumption (AO), which contradicts Equation (1). □ 

Proof of Theorem 8. As a consequence of Theorem 12, tToo = irj, 
where a(f) = V 1 n T n . Define 

P 00 (x,F):=P(x,F), VxeX,VFea(/). 

We will prove that, for any F G cr(f), Poo(-,F) is er(/)-measurable and 
consequently will be a compatible projection. This implies that there 
exists a measurable function hp : (J$.,Br) — > (R,23r) so that P 00 (o;,i ;i ) = 
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hp(f(uj)). Therefore, if xtrjy, then P^x^F) = P OQ (y,F), which is the 
thesis. 

Thus, we show that for any F G cr(/) and t G M, we have 

(9) H:={x:P(x,F)<t}ea(f) 

To prove Equation (9), we first show that it is true when F G T n by proving 
that 

H = n m>n 7r~ 1 7r m (i?), 

which implies that H G cr(f). The inclusion H C n m 7r~ 1 7r m (i?) is always 
true. For the other inclusion, let y G n m>n 7r~ 7r m (iT). Let m > n; there 
exists x m G il such that y^mX m - Therefore, Equation (4) and the definition 
of ~ £n imply P(y, F) < P(x m , F) + e m <t + e m , for any m > n. As e m \ 0, 
we obtain that y G il. Then Equation (9) is true on the algebra Alg := Un^. 

Actually, let F n G j4/<7 such that F n f F. We prove that Equation (9) 
holds for F by showing that 

H={x: P{x,F) <t} = n n {x: P{x,F n ) < t} =: n n H n . 

Again, since F n C F, then P(x,F n ) < P(x,F) and therefore H C f] n H n . 
Conversely, the set r\ n H n \ H is empty since the sequence of A? -measurable 
maps P(-, F) — P(-, F n ) converges to 0: 

P(; F) - P(; F n ) = P(; F \ F n ) —> P(; 0) = 0. 

Then Equation (9) is true on the monotone class generated by the algebra 
Alg = Un^n, i.e., Equation (9) is true on a(f). □ 

Proof of Theorem 3. Given a target algorithm {X,T n ,T,P n ) n , let 
t^oo = 17 be defined as in Theorem 8. We show that ir^ is optimal. Let 
ip g be another compatible projection and let (X,a(g),T, P g ) be the target 
problem given by Theorem 2. We are going to prove by induction on n that 

(10) VnGN, T n Qo{g). 

In fact, for n = it is sufficient to note that J-q = a({T}) C <r(g). 

Equation (4) states that T n = a(J- n -i, h n ), where h n is the discrete ran- 
dom variable, given by Lemma 6, s.t. 

^TT hn y 
t 

((P(x,At\) (P(y,At%)) 
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Let fcj n_1) :X^[0, 1] be defined as k^~ 1] {x) = P(x,Af~ 1] ). Then 
X 



n (n—1) >. 



5 > 



Obviously, <r(h n ) C <r(A;j n , /4™ , • ■ •)• For the induction step, as A^ n ^ € 

•^n-i C cr(p), we have that P g (-,A[ n ^) is cr(<7)-measurable, and therefore 

°{k { r 1] ) Q <r(g). Then T n = a(T n ^h n ) C <7(^ n _i, ■ ■ ■) C 

(7(5). Therefore cr(f) = Vn^n Q &{9), which implies -k^ D ^ g by Lemma 9, 
and hence ir^ is optimal. □ 

Corollary 13. it^ does not depend on the choice of Str. 

Proof, ir^ = n n ir n is optimal, V(ir n )n = Str(P). The optimal projection 
being unique, we are done. □ 

Proof of Theorem 1. Let -Koo = itf be defined as in Theorem 8 and 
(X, a(f),T, Pqo) be given by Theorem 2 so that P(x, F) = Poo(x, F) for any 
F € cr(/). Then each (P n ) n of Definition 5 can be rewritten as 

(11) P n ( X ,F) = J ^ P ^\ FMdZ \ V x eX,VFEF nt 

where [x] n is the 7r n -class of equivalence of x and n([x] n ) > since [x] n ^ 0. 

Note that d(P, P m ) < 2J2 n & n - Then, for any e > 0, there exists an N so 
that J2 n >N /3 n < f • Therefore we are going to prove by induction on n that 

sup \ P^(x, T) — P n (x, T)\ — > as m tends to infinity, 

X 

which completes the proof. If n = 1, then by definition of e m , since T € 
Fm-i, we have that 

J M \Poo(z,T) - P(x,T)\n(dz) 
\P m (x,T)-P(x,T)\ < JMJ °° K . K ' )m 

JL, \P(z,T)-P(x,T)\fi(dz) 



/[«]„ _ 



Ml? 
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For the induction step, we note that 

(12) \P r n +\x,T)-P n+ \x,T)\ 

< £ \p m {x,4 m) )P»(4 m \T) - J P(x,dz)P n (z,T)\, 

i i 

where (A^ )j is the partition of X given by n m . By induction hypothesis, 

\P^(z,T)-P n (z,T)\<e 
for m > mo large enough. Since [z] m = A™ if z € ^4j- m \ it follows that 



J im) P(x,dz)\PZ(A^\T)-P n (z,T)\ <ej P(x,dz). 

Equation (12) becomes 
\P« + \x,T)-P n+1 (x,T)\ 

< e + £ K.{ A (m) , r) | p m (x, A (m) ) - P{x, At ] ) I 

i 

<e + ^\P n (x,4 m) )-P(x,4 m) )\. 

i 

On the other hand, by Equation (11), 

P m (x,At ] ) - P(x,At ] ) = I P ^ A ^- P ^ A(r) \ (dz). 

J[x] m MfW) 

The definition of ~ £m+1 states that 

i 

whenever z€ [x] m and therefore 
\P^ +1 (x,T)-P n+1 (x,T)\ 

{m) ,j4dz) 



J\x\m .' 



< e + e 



m+l- 



MM 

Since e m — > as m tends to infinity, we get the result. □ 
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Proof of Theorem 4. By Definition 5 and Lemma 7, {P n (-,F)) n > m is 
a martingale with respect to the filtration (F n )n>m, for any F G T m . Then, 
if Y F (x) = P(x,F) as in Definition 5, we have that 

P n (x,F) — » E„[Y F \V n f n ](x) = Y F (x), for /i-a.e. x G X,VF G U m ^ m . 

n— >oo 

Let 7TQO = 7rj be defined as in Theorem 8 and (X,a(f),T, P^) given by 
Theorem 2 so that P(x,F) = P OQ (x,F) for any F G cr(f). Then 

P n (x,F) — ► Poo(x,F) 

n— »oo 

for any x G X and F G U m .F m , the only /i-null set in U m f ra being the empty 
set. □ 

5.1. Weak convergence of conditional probabilities. Let the target prob- 
lem (X,F,T,P) be given and let Str = (Str n ) n , where Str n (X, J 7 , T, P) = 
(X,jF n ,T,P n ) be a target algorithm. In order to prove Theorem 5, which 
states the total convergence of the probability measure P n (x,-) towards 
P(x,-), we proceed as follows: 

• first, we define the topology gp on X; 

• then, we define a "natural" topology rstr on X associated to any target 
algorithm (Str n ) n . We prove in Theorem 14 the total convergence of 
(P n )n to Pqo, under this topology; 

• then, we define the topology rp on X as the intersection of all the 
topologies T Str ; 

• finally, we show Theorem 5 by proving that qp C rstr- The non- 
triviality of Qp will imply that of Tp. 

We introduce the pseudometric dp on X as follows: 

d P (x,y) =Y,P n \P n (*,T) - P n (y,T)\. 

n 

Now, let Tstr be the topology generated by U n .F n . C is a closed set if and 
only if C = n n C n , C n G T n . In fact, if C G !F n , for a given n, then C G Fn+p, 
for any p and therefore C is closed. (X, rstr) is a topological space. 

Remark 5. Let us go back to Example 1. The topology defined by ask- 
ing that the sets in each T n are closed is strictly finer than the standard 
topology. On the other hand, the same example may be explained with left 
closed-right opened dyadic subdivisions, which leads to a different topol- 
ogy that also contains the natural one. Any other "reasonable" choice of 
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subdivision will show the same: the topologies are different, and all contain 
the standard one. In the same manner, we are going to show that all the 
topologies rstr contain the standard one, gp. 

Theorem 14. Let the target problem (X,J-,T,P) and the target algo- 
rithm (X, T n ,T, P n ) n be given. For any target algorithm Str, 

P n (x,-)^P(x,-), VxeX 

TStr 

Proof. Let C = r) n C n be a closed set of Str and let P n be any ex- 
tension of P n to \J n T n . We have to check that limsup n P n (x, C) < P(x,C), 
for any given x (see, e.g., [4]). Note that, since C G \Z n Fn, we have P(x, C) = 
Poo Or, C). But, P n (x, C)-Poo(x, C) < P n (x, C n -x)-Poo{x, C) = P n (x, C n _i)- 
Poo(x, C). Actually, 

P n (x, C n -\) — Poo(x, C) 

= ( Pn(g,gn-l)--Poo(g,Cn-l) ) + ( Poo (f, Cn-1 ) ~ Pqo (x, C) ) ■ 
I II 

I — > as n tends to infinity, from the target algorithm, since as e n \ 0. For 
any e > 0, there exists Ni > 0, such that for any n > N%, \P n (x, C n -\) — 
Poo{x, C n —i)\ < |. 

— > as n tends to infinity, from the continuity of the measure. For any 
e > 0, there exists ^2 > and for any n > N2, \Poo(x, C n _i) — Poo(x, C)| < 

!• □ 

An example of a natural extension of P n to P n is given by 

P n (x,F) = E^[Y F \ V n F n \(x), Vx G X, VP € V n T n , 

where, for any P € V n J- n , Y F is the V n .P n -random variable such that 
Y f (oj) = P 0O (a;, P). As mentioned for P n , P n (x, •) is a probability measure, 
for any x £ X. 

Corollary 15. For any fixed strategy Str(P), let P n be as in Theo- 
rem 1. We have 

P n (x,-)^>P(x,-), 

TP 

for any given x. 
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In order to describe the topology Tp, we will denote by [[F]]* the closure 
of a set F C X in a given topology *. Note that the monotonicity of 7r n 
implies 

[[F}]r StI = n n [[F]] TStin 

where rst r „ is the (discrete) topology on X generated by T n . Since Tp is the 
intersection of all the topologies rstn we have 

[[F]] TP 5 [[F]] TSti . = n n [[F]] rstrn , VP e 2 x ,VStr. 

PROOF of Theorem 5. Let F be the closed set in gp so defined 

F := {y £ X : d P {y,x) > r}, 

i.e., F is the complementary of an open ball in (X, dp) with center x and 
radius r. If we show that F £ Tp, then we are done, since the arbitrary 
choice of x and r spans a base for the topology gp. 
We are going to prove 

^=[[^]]r str =n m [[F]] TStrm , VStr, 

which implies [[F]] T p = F. It is always true that F C [[F]]*; we prove 
the nontrivial inclusion F D n m [[F]] rstr . Assume that y € [[F]] Tgtr . Now, 
y € [[-F]] Tstr , for any m, and then there exists a sequence (y m ) m with 
y m € F such that yn m y m , for any m. Thus, y € r) m [y m ] m , where [x] m is 
the 7r m -class of equivalence of x. Thus 

P^(y m ,T) = PZ(y,T), Vm,n 

since P m (-,T) is F m -measurable. By Theorem 1, for any n € N, 

|P re (y,T) - P^(y,T)| + \P^{y m ,T) - P n (y m ,T)\ ^0. 

Now, let TV" be such that J2^=n @ n < § and take no sufficiently large s.t. 

TV 

£ |P"(y,T) - P£(y,T)| + |P" o (y no ,T) - P"(y no ,T)| < | 

n=0 Z 
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We have 

dp(y no ,y) = £/? n |P n (y,T) - P n (y no ,T)\ 

n 

N oo 

<J2\P n (v,T)-P n (y no ,T)\ +2]T/r 

n=0 n=N 

N 

< £ (|P™(y,T) -P«( y ,T)| + IP- (y,r) - 1» (y no ,T)\ 

n=0 

+ |P n >„ ,T) - P n (y no ,T)\) +2 € -< € -+ e - = e. 

and therefore 

dp{x,y) > d P (x,y no ) - d P (y no ,y) > r - e. 
The arbitrary choice of e implies y £ F, which is the thesis. □ 

APPENDIX A: RESULTS ON EQUIVALENCE RELATIONS 

In this appendix we give the proof of auxiliary results that connect equiv- 
alency with measurability. 

Proof of Lemma 6. Let tt = wf be a discrete equivalency on X. Then 
X/ir defines a countable measurable partition of X. Conversely, let {A±, A2, ■ ■ •} 
be a countable measurable partition on X. Define / : X — > N s.t. f(x) = 
n x G A n . Therefore / is measurable and 7r = ttj is a discrete equiva- 

lency on X. □ 

Lemma A.l. Let f,g be two random variables such that g(x) < g(y) =4> 
f(x)<f(y). Thena(g)Ca(f). 

PROOF. Let t G R be fixed. We must prove that {g < t} G a(f). If 
{9 < t} = or {g > t} are empty, then we are done. Assume then that 
{9 < t}, {9 > t} 7^ 0- We have two cases 

- t* G f({g < t}): 3x* G {g < t} such that t* = f(x*). 

By definition of t*, {g < t} C {/ < £*}. Conversely, let y £ {g > t}. Since 
<?(x*) < t < g(y), then f(x*) = t* < f(y), i.e. {g > t} C {/ > t*}. Then 

{5 <*} = {/<**} Gff(/). 

- /({</ < *}): Vi £ {3 < t} we have that /(x) < t*. 

Then < t} C {/ < t*}. Conversely, let y £ {g > t}. Since Vx G {5 < t} 
5(2/) > then f(y) > f(x), which implies f(y) > sup f({g < t}) = t*, 

i.e. {g>t}Q{f> t*}. Then {g < t} = {/ < t*} G a(f). □ 
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The next lemma plays a central role. Its proof is common in set theory. 

Lemma A. 2. For all n £ N, let n n be a discrete measurable equivalency. 
Then there exists a random variable f such that o~(f) = y n a{X/'K n ). 

Proof. Before proving the core of the Lemma, we build a sequence 
(<7n)ngN of functions g n : N n — > K that will be used to define the function /. 

Take h : N U {0} — > [0, 1) to be the increasing function h(m) = 1 — 2~ m 
and let (<7n)neN the sequence of function g n : N n — ► R so defined: 

5i (mi) = /i(mi - 1) 

g 2 (mi,m 2 ) = fli(mi) + M m 2 - l)A#i(mi) 
g n+ i(m n ,m n+ i) = g n (m n ) + /i(m n+ i)A5( n (m n ) 



where, for all n, m n = (mi, . . . , m n ) and 

Ag n (m n ) = g n (m n _i, m n + 1) - 5 n (m n _i, m n ). 

As a first consequence of the definition, note that for any choice of n and 
m n+ i, it holds that 

(13) 5 n (m n _i,m n ) < ^ n+ i(m n+ i) < # n (m n _i,m n + 1) 

since h € [0, 1). We now prove by induction on n\ + n 2 that for any choice 
of ni G N, n2 G N U {0} and m ni+n2 , we have 

(14) # ni (m ni _i,m n ) < st ni+n2 (m ni+n2 ) < c? ni (m ni _i,m n + 1). 

Eq. (14) is clearly true for n\ + n% = 1, since h is strictly monotone. The 
same argument shows that Eq. (14) is always true for n 2 = and therefore 
we check it only for n 2 > 0. We assume by induction that Eq. (14) is true 
for m + n 2 < n and we prove it for n\ + n 2 = n + 1. By using twice the 
induction hypothesis, as n 2 — 1 > 0, we obtain 

Sr ni (mi,m n ) < 

S , ni+n2-l( m ni+rt2— 2i rn n\+n2 — l) 

2, ^ni+n 2 -l + 1) 

< 5 ni (mi,m n + 1). 
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Eq. (14) is now a consequence of Eq. (13). 

Now, we come back to the proof of the Lemma. First note that, without 
loss of generality, we can (and we do) require the sequence (7r n ) n€ N to be 
monotone, by taking the sequence ir' n = n" =1 7Tj instead of ir n . ir' n is again a 
countable measurable equivalency on X. In fact, by Lemma 6 we can read 
this statement in trivial terms of partitions: since at most countable intersec- 
tion of families of countable measurable partition is a countable measurable 
partition. Moreover, by definition, Vi = xl n cr(X/iTi) = \Zi = il n a(X/ir' i ). 

Let r n = X/n n be the increasing sequence of countable measurable dissec- 
tions of X. We are going to give a consistent inductive method of numbering 
the set of atoms of T n to build the functions /„. Let t\ = {A± ,A^ , ■ ■ •} be 
any ordering of t\. By induction, let {^4^ i , 2 > ■ ■ ■} be the partition of 
the atom A^ n € r n given by T n+ \. Define, for any n G N, 

fnix) = g n (m n ) <^ x e 

To complete the proof, we first show that cr(f n ) = a(X/ir n ), Vn, and then 
we prove a(f) = a (fx, / 2 , . . .) by proving that f n —>f pointwise. 

To prove that cr(f n ) = a(X/ir n ) we show that f n (x) = f n (y) 
3m n : x,y S A)$ n . One implication is a consequence of the fact that f n 
is defined on the partition of X given by Xjix n = r n . For the converse, as- 
sume that x € ^4mi 7^ A^fl 3 y and consider n\ := min-h' < n: rrij / m'A. 
Thus m ni _i = m' ni _x and, without loss of generalities, m ni < m' ni . By 
Eq. (14), we have 

f n (x) = g n {m n ) 

< JmK-lA, + 1) < 9n 1 { m 'n 1 -l: m 'n 1 ) < 9n(m' n ) = f n (y). 



a(f)=a(fx,f % .-.). 

C. The sequence (f n ) n is monotone by definition and bounded by Eq. (14). 
Then 3/: /„ | / and thus a(f) C a(f 1 ,f 2 , ■ ■ ■)■ 

D. Let n be fixed, and take x,y € X with f n (x) < f n (y)- Then, for any 
h > 0, T n C T n+ /j implies rr € ^4m^ 7^ ^4^?/ 3 V- As above, consider 
nx := min{j < n: mj ^ m'A. As / n (x) < /„,(y), we have m ni _i = ! 



G. ALETTI, D. SAADA/TARGET PROBLEMS IN BLACKWELL SPACES 27 



and m ni < m' ni . Again, by Eq. (14), for h > n\ + 1 — n, 

f n +h(x) = g n+h (m n +h) 

< 5ni+i(m ni ,m ni+ i + 1) = a 

< £f ni (m ni _i,m ni + 1) 

< SnK) = fn(y), 

i.e., V7t, f n +h{x) < a < f n (y). As /j | /, f(x) < f(y). Apply Lemma A.l 
with g = f n to conclude that <r(/ n ) C <r(/). □ 

As a consequence of Lemma A. 2, any countably generated sub cr-algebra 
is generated by a measurable equivalence tt, as the following lemma states. 

Lemma A. 3. .4 C X is countably generated if and only if there exists a 
random variable f such that A = o~(f). 

Proof. Let A = cr(A\, A 2 , . . .). Apply Lemma A. 2 with X/ir n = 
{A n ,X\A n }. 

<= Take a countable base Bi,B%,... of £>r and simply note that <r(/) = 
a{{r\B l ),r\B 2 ) 1 ...}). □ 

Proof of Lemma 9. Let x e X be fixed. By hypothesis, Ai C A 2 - If 
Ai = a(A\, A\, . . .) then A 2 will be of the form A 2 = a{A\, A\, A\, A%, . . .). 
Without loss of generality (if needed, by choosing X \ A 3 n instead of A J n ) we 
can require x G A J n , for any n G N and j = 1, 2. Then [x]a 2 = n n (^n A%) C 

The last part of the proof is a consequence of Lemma A. 3 and of the first 
point, since 

r 1 ({f(x)}) = [xl f C[xl g =g-\{g(x)}), 
or, equivalently, f(x) = f(y) g(x) = g(y) which is the thesis. □ 

Proof of Theorem 12. Note that X/tToo C X is countable, generated 
by U n A/-7r n . Then tt^ is a measurable equivalency by Lemma A. 3. 

Conversely, we can use the standard approximation technique: if ir = 7rj 
is measurable, let /„ = 2~ n \_2 n f\ for any n. Since f n are discrete random 
variables, 7r n are defined through Lemma 6. By Lemma 9 and Eq. (3), the 
thesis TTf = n n 7r n will be a consequence of the fact that a(f) = V n cr(/ n ). 

c(/n) f= (T (/) by definition, which implies <r(/i, /2, . . .) C <j(/). Finally, as 
/„ -> /, we have cr(f) C. a(f\, f 2 , . . .), which completes the proof. □ 
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APPENDIX B: PROOF OF THEOREM ?? 
Before proving the theorem, we state the following Lemma. 

Lemma B.l. Let (X, X) be a measurable space. 

1. If any equivalence relationship it on X is measurable, then X = 2 X 
and card(X) < card(K). 

2. The converse is true under the axiom of choice. 

Proof. 1 =^> 2. Let 717 be the identity relation: xiriy x = y. By 

hypothesis, there exists / such that 717 = 717, and thus / is injective. Then 
card(A) < card(M). Now, take A C X and let txa be the relation so defined: 

xnAy <^=> {x, y} C J 4or{i,y}CI \ A. 

Since any equivalency is measurable, then there exists / : (X, X) — ► (R, Br) 
such that tta = 717. But o~(f) = o~(A), which shows that iCI => A G X, 
i.e. X = 2 X . 

2 1. Since card(A) < card(R), there exists an injective function h : 
X — > R. Let 7r be a equivalence relationship on X, and define the following 
equivalence on M: 

ri Rr 2 ({ri,r 2 } C h(X) and h' 1 (n) n h' 1 (r 2 j) or {r u r 2 } C R\h(X) 

By definition of R, if we denote by ttr the canonical projection of M on M./R, 
then 7tr o /i : X — > R/i? is such that 

ttr o h(x) = ttr o h(y) -4=^ xny. 

The axiom of choice ensures the existence of a injective map g : M/i2 — > R. 
Then /":=go7TRo/i:X^IRis such that 7r = 717. / is measurable since 
Af = 2 x . □ 

Proof of Theorem 11. 1 2. By Lemma B.l and Assumption (AO), 
(X, 2 X ) is weakly Blackwell. 

2 3. Assume X is uncountable. By CH, exists Y Q X s.t. Y^ilR (i.e. 
Y is in bijection with IR via gi). Take a bijection kSr\{0}. Then the map 



g{x) 



r 92{gi{x)) ifxGY; 
if x £ X \ Y; 
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is a bijective map from {Y, {X\Y}} to K. Equip K with the Borel cr-algebra 
and let A\ = g~ 1 (B^). A\ is countably generated and its atoms are all 
the points in Y and the set X \ Y. Now, take a non-Borel set N of the 
real line. A2 = g~ 1 (cr(B^, N)) is also countably generated, A\ C A2 and its 
atoms are all the points in Y and the set X \ Y, too. Since Ai C 2 X and 
.4.2 S 2 X , (X, 2 X ) is not a weakly Blackwell space by Lemma 10. 

3 => 1. Since X is countable, then X/tt is. Therefore, Lemma 6 ensures 
any equivalence tt is measurable, since X = 2 . Finally, just note that 
each countable set is strongly Blackwell. And thus Lemma 10 concludes the 
proof. □ 
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